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Can we really measure /nl from the galaxy power 
spectrum? 
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ABSTRACT 

The scale-dependent galaxy bias generated by primordial non-Gaussianity (PNG) can 
be used to detect and constrain deviations from standard single-field inflation. The 
strongest signal is expected in the local model for PNG, where the amplitude of non- 
Gaussianity can be expressed by a set of parameters (/nl: 5nL: •■•)■ Current observa- 
tional constraints from galaxy clustering on /nl and (/nl assume that the others PNG 
parameters are vanishing. Using two sets of cosmological A^-body simulations where 
both /nl and f/NL are non-zero, we show that this strong assumption generally leads 
to biased estimates and spurious redshift dependencies of the parameters. Addition- 
ally, if the signs of /nl and ^nl are opposite, the amplitude of the scale-dependent 
bias is reduced, possibly leading to a false null detection. Finally we show that model 
selection techniques like the Bayesian evidence can (and should) be used to determine 
if more than one PNG parameter is required by the data. 

Key words: methods: numerical - cosmology: cosmological parameters - large-scale 
structure of Universe 



1 INTRODUCTION 

Standard single-field inflation scenarios predict primordial 
perturbations with nearly Gaussian initial conditions. In 
fact, the tiny non-Gaussian features arising from the self- 
coupling of the inflaton field would remain undetectable. 
Any evidence for primordial non-Gaussianity (PNG) would 
then imply the emergence of non- standard physics in the 
early universe (for a review see e.g. iBartolo et "aLll2004 ). 

Here we consider the local model of PNG, in which the 
Bardeen curvature potential during matter domination (at 
high-redshift) is given by 



$(x) = 0(x) + /nl [0(x)2 - 



+ [0(X)^-3(0')0(X)] 



(1) 



where 0(x) is a zero-mean Gaussian random field, /nl and 
(7nl are real constants, and the symbol (■) denotes expecta- 
tion values. On large scales, PNG of the local type introduces 
a scale-dependent bi as between f^ alaxies and the underlying 
matt e r distributio n (Dalai ct al. 2008, Matarrese & Verde 
|2008| . ISlosar et aL, 2008, Giann antonio fc Porciani. .2010). 
Several authors have derived observational constraints on 
/nl and gNL from the galaxy power spectrum or 2-point 
correlation function (jSlosar et al.ll200dlDesiacques fc SeliakI 
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I2OI0I ), and recently IXia et ahl (|201ll ) reported a possible de- 
tection of positive /nl at the 2a level. However, these stud- 
ies all assume that the underlying model for PNG is either 
purely quadratic (ijnl = 0) or purely cubic (/nl = 0). In 
this Letter, we show that the assumption of a one-parameter 
model significantly biases the estimation of the PNG pa- 
rameters when both /nl and (;nl do not vanish. We also 
illustrate that model-selection techniques like the Bayes fac- 
tor are powerful tools to identify the model that best de- 
scribes the available data. This is particularly interesting in 
light of the ne xt generation of gal axy and cluster surveys 
like eROSITA (iPredehl et al."2010'). Euclid (|Laureiis et all 
l20lj ) or LSST (|lvezice t al. 2008) which are expected to 
significantly reduce t he error bars of PNG par ameters (e.g. 
iPillepich et aLll2012l . iGiannantonio et al.|[20l3 ). 



2 SCALE-DEPENDENT BIAS AND PNG 

The linear matter density contrast in Fourier space 5m (fc) 
follows from the curvature perturbations by 



with 



a{k, z) — 



2k^D{z)T{k) 



30, 



1^0 



(2) 



(3) 
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where fim and Hq are the current matter-density parameter 
and Hubble constant, T{k) is the transfer function, and D{z) 
is the growth factor normahsed to {l + z)~^ during the epoch 
of matter domination. 

Non-Unear structures form with time via gravitational 
instability. Let us denote by Pm(A:) the matter power spec- 
trum and by Phm(fc) the cross spectrum between dark- 
matter haloes (of a given mass) and mass. On large scales, 
the relation between Phm(fc) and Pm{k) is given by 



Phm(fc) 



' 6i(/nL,3Nl) + Afe(fc,/NL,5NL), 



(4) 



where bi is the linear bias parameter (a real-valued 
constant) and A6(A;, /nl, ^nl) is a scale-dependent bias 
term introduced by P N G. F ollowing the work of 
ISmith. Ferraro fc LoVer"d3 1I2OI2! ) (hereafter SFL12), the 
latter term can be split up into different contributions (to 
first order in /nl and (/nl): 



A6(fc,/NL,5NL) = 



/3/,fNL + /3gffNL 

a{k) ■ 



(5) 
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with 



/3/=2(i>^-l), 



u = [Scibi - 1) + 1] 



1/2 



(6) 



(7) 



where 5c is a parameter set to ~ 1.4 to fit the clustering 
properties of friends-of-friends halos in numerical simula- 
tions. Note that 61 modulates the amplitude of the scale- 
dependent bias through u. Similarly, the dependence on 
(7NL is (SFL12) 



= 4'* (M) [-0.7 + lA{u - if + Q.6{u - if] 



dKS^\M) (v 



dloe 



(8) 



with Kg^' the skewness of the mass density field (smoothed 
on the mass scale M) in the case of /nl = 1 and g the linear 
rms mass-density fluctuation on the scale M. The skewness 
and its derivative can be expressed in terms of 61 and z (see 
equations 46 and 47 in SFL12). 

In brief, PNG of the local type makes the bias of dark- 
matter haloes scale dependent and proportional to for 
fc ^ 0. It is therefore conceivable to constrain PNG from 
measurements of the galaxy power spectrum, provided a 
model that links galaxies to haloes. However, the /nl and 
giNL contributions to the scale-dependent bias in equation 
^ show the same scaling with the wavenumber and it is 
not possible to measure both parameters from studies of 
a single population of tracers of the large-scale structure. 
Given that /?/ and fig display different scaling with halo 



^ Some authors write v in term of the l inear-bias factor obtained 
from Gaussian initial conditions, but iGiannantonio fc Porcianil 
1I2OIOI ') have demonstrated that the full bi should appear in equa- 
tion lO. Note, however, that the correction to h\ due to PNG 
is generally small and, in most practical cases, the difference be- 
tween the two formulations is negligible. 



mass and redshift, this degeneracy can be broken by consid- 
ering several populations of galaxies, possibly within widely 
separated redshift bins. Recent studies have accomplished 
this task by combining a number of observational datasets 
with simple models where either /nl or (jinl are set to zero a 
priori. With the help of numerical simulations, in this Letter 
we study the effect of these simplifying assumptions on the 
determination of PNG. 



3 SIMULATIONS 



We have used Gadget-2 (ISpringefe OOS*) to perform two sets 
of dark-matter-only simulations consisting of four realisa- 
tions each. Each set corresponds to a different fiducial model 
of PNG assuming /nl = 50 and pnl = ± 5 ■ 10^, in accor- 
dance with upper li mits obtained from cosmic-microwave- 
background studies (|Komatsu et al.l l201ll . IVielva fc Sand 
[2OIQ) . The other cosmological param eters are chosen in con - 
cordance with the WMAP5 analysis (|Komatsu et al.ll2009l ): 
Q.^ = 0.279, = 0.721, fib = 0.0462, ag = 0.817, h = 0.7, 
Us = 0.96. We follow 1024^ particles within a cubic box of 
side L = 1200 Mpc, corresponding to a particle mass of 
1.247 ■ 10^^ h^^ Mq. The initial conditions are generated at 
redshift 50 using the Zel'dovich approximation to displace 
particles from a regular Cartesian grid. 

Haloes are selected using a friends-of-friends halo finder 
with a linking length of 0.2 times the mean interpar- 
ticle separation. We consider only objects with at least 
80 particles, so the halo mass range is 
Mhaio < 2 • 10^^ ^"^Mq at 2 = 0. We use outputs at 
z — {0,0.5,1,1.5,2}. The scale-dependent bias parameter 
b{k) for the haloes is measured from the ratio of the (fe- 
binned) halo-matter cross spectrum and the matter auto 
spectrum, averaged over the four realisations. We only con- 
sider the range 0.007 /iMpc"^ < fc < 0.04/iMpc"^ where 
equation (|8]) holds as shown in SFL12. The errors on fe(fc) 
are calculated using an e stimator derived in the appendix of 
ISmith fc LoVerdd (|201ll ). 

To make sure that equations ([SJ-® (with Sc = 1.42) 
can also be applied to our simulations, we fix /nl and ^nl 
to the input values and use a standard x^"fit to measure hi 
as a function of halo mass and redshift. We find good 
values in all cases, and the fitted bi is compatible with the 
value expected for th e case of Gaussian i nitial conditions 
(see equation 14 from iPillepich et al.ll2010l ). We can there- 
fore safely assume that the fitting formulae given by SFL12 
accurately describe the scale-dependent bias for our sets of 
simulations. 



4 PARAMETER ESTIMATION 

We employ the simulations to build mock observational data 
for fe(fc) and use them to estimate the PNG parameters under 
different model assumptions. We consider two sets of halo 
mass and redshift bins as described in Table [1] The first 
set is chosen to closely resemble the observations used by 
IXia et al.ll|201ll ). The second set covers a wider range in red- 
shift and linear bias parameter 61 and exemplifies datasets 
(for groups and clusters of galaxies) that will be available in 
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Figure 1. Credibility intervals (68.3 and 95.4 per cent) for /jml and flNL- Solid: positive (;nLi dotted: negative ^nl- The black crosses 
indicate the input values for /nl a^nd flNL- Left panel: using the current sample, right panel: using the future sample. 



Table 1. The top section describes the galaxy samples used in 
the analysis of Xia et al. (2011): redshift, type, effective linear bias 
parameter boff and number of objects N^^i. The middle section 
illustrates the properties of specific halo subsamples from our sim- 
ulations that have been selected to approximate the observational 
dataset described above: redshift, mass range (in 10^'^ VIq), 
linear bias parameter 6i, number of objects in each mass bin, 
^haloi after combining the four realisations. We denote this as 
the "current sample" . The bottom section refers to an hypotheti- 
cal sample of more massive objects that should become available 
in the coming times. We label this the "future sample" . 
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4.1 Two-parameter model for PNG 

We first consider a model (M2) with three free parameters, 
/nl, 5nl and 61, and assume Gaussian errors on h{k) to 
build their likelihood function. We adopt a flat prior within 
a finite region of parameter space (considerably more ex- 
tended than the likelihood function) and marginalize the 
posterior distribution over 61. Fig.[l]shows the marginal pos- 
terior probability density for /nl and (?nl by marking the 
68.3 and 95.4 per cent credibility intervals for the model pa- 
rameters. The two panels refer to the different sets of mass 
and redshift bins defined in Table [T] (current data on the 



left, future data on the right). Different sets of curves in 
the same panel refer to the different fiducial models used 
to generate the initial conditions of the simulations. In all 
cases, the fiducial values for /nl and (?nl, indicated by the 
black crosses, lie within the 95.4 per cent credibility region 
region. Note that the current datasets are not expected to 
give tight constraints on the model parameters which are 
also degenerate. The credibility intervals shrink when more 
mass/redshift bins are used. The small rotation of the con- 
tour levels in the right panel is caused by including data 
from 2 = 2, which significantly improves the constraints on 
(/NL, and only slightly weakens the constraints on /nl (with 
respect to the current dataset). 



4.2 One-parameter models for PNG 

We now consider two simpler models by assuming that PNG 
is either purely quadratic ((;nl = 0, Mi/) or purely cubic 
(/nl = 0, Mig). All the rest is left unchanged with respect to 
M2, so that M2, Mif and A'hg form a set of nested models. 

The marginal posterior probability distribution for /nl 
obtained combining the current dataset with model Mif is 
presented in Fig. [21 The left panel shows the result for the 
simulation with positive (?nl and the right panel for negative 
giNL- Different line styles indicate the individual redshifts 
bins: dotted, long-dashed, dot-dashed for z = {0.5, 1, 1.5}, 
while the solid curve shows the combined probability for all 
redshifts. For the simulation with (;nl = 5-10^, the resulting 
95.4 per cent credibility region (lower horizontal solid line) 
marginally includes the input value /nl = 50, indicated by 
the vertical dotted line, but the maximum is shifted towards 
higher values. When /nl and ijnl have opposite sign, the 
maximum a posteriori value is biased low, and the input 
value is excluded at high significance. In both cases, there 
is an apparent redshift dependence of /nl, and this trend 
depends on the sign of (;nl- Thus, if a redshift dependence 
was ever detected from observations, it may just indicate 
that the wron g model f o r PNG is assumed. We note that 
the results in ixia et al.l HoTl) actually hint at a redshift 
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Figure 2. Constraints on /nl for model Mif (where = is assumed a priori). Left: simulation with g^L = 5 ■ 10^ i right: simulation 
with g^L = -5 - 10^. Constraints from the 3 redshift bins in the current sample are shown with different line styles: z = 0.5 dotted, z = 1 
long-dashed, z = 1.5 dot-dashed. The posterior pdf obtained combining the 3 redshift bins is shown with a solid line. For comparison, 
we also show the combined posterior for the future dataset (short-dashed). The two horizontal lines indicate the 68.3 and 95.4 per cent 
credibility intervals for each of the combined posteriors. The vertical dotted line shows the input value /nl = 50- 



dependence of /nl although the error bars are quite large. 
If taken at face value, the ordering of the estimates would 
suggest that ^nl < 0. 

Additionally, Fig. [5] includes the combined probability 
for the 5 mass and redshift bins of the future dataset for 
comparison (short-dashed). The shift away from the input 
value is qualitatively the same, but larger for the future 
dataset. This can be understood by comparing the degener- 
acy between /nl and A'nl in Fig. [T] 

The marginal posterior probability distribution for gNL 
under model Mig is shown in Fig. |3l As in the previous Fig- 
ure, the two panels show the results for the simulations with 
gNL = ± 5-10^. In both cases, the maximum a posteriori esti- 
mate is biased high, and the input value lies well outside the 
95.4 per cent credibility regions. The estimates from the dif- 
ferent redshifts are almost incompatible with each other, and 
there is no apparent trend with z as seen in model Mi / . For 
the case of negative (;nl , the estimation procedure favours a 
Gaussian model although both input PNG parameters are 
substantially different from 0. The combined probability of 
the future dataset is shown by the short-dashed curve. Here, 
the shift away from the input value for gNL is smaller, and 
the constraints are tighter than for the current dataset, in 
accordance with the rotation of the contours in Fig. [1] 

4.3 Model selection 

Our results show that the choice of the underlying model 
for PNG has a significant effect on the estimates of /nl 
and gNL- Only model M2 returned estimates which were in 
agreement with the fiducial values, while setting a parameter 
to zero a priori (as often done in the literature) produced 
biased results. Here we investigate if the data themselves 
can be used to establish which parameterisation should be 
preferred. 

Given a set of data D, the relative evidence in favour 
of a model with respect to a second one can be quantified 



in terms of the Bayes factor (e.g. lKasslll993l . lTrottall2008h 

^ P{D\Ma) ^ J Ca(D\Ma,ea)lla{ea)Ae^ 

P{D\Mt) ^ J Cb{D\Mb,eb)MSb)dei,' ^' 

Here £i{D\Mi,6i) gives the probability of getting the data 
D given the model Mi with parameters 9i (i.e. d is propor- 
tional to the likelihood function for the model parameters), 
■KiiOi) is the prior probability density for the model parame- 
ters and P[D\Mi) denotes the Bayesian evidence for model 
Mi (i.e. the probability of getting the data under model Mi 
after marginalizing over the values of the model parameters) . 

We find that M2 is strongly favoured over Mi / and M\g , 
with a Bayes factor > 30, except for one cas^f]. Based on the 
Jeffreys scale (Jeffreys 1961i ), this provides "very strong" ev- 
idence in favour of M2 ■ We thus recommend the use of Bayes 
factors in future determinations of PNG from observational 
data based on galaxy 2-point statistics. 



5 CONCLUSION 

Using two sets of A'^-body simulations starting from 
non-Gaussian initial conditions of the local type with 
{/nl,5nl} = {50, ±5 ■ 10^}, we have investigated the 
bias of dark-matter halos on large scales. Motivated by the 
widespread practice of considering only one-parameter mod- 
els for PNG to fit observational data, we have studied the 
effect of model assumptions on the determination of model 
parameters based on 2-point statistics of the galaxy distribu- 
tion. We have considered two sets of mass and redshift bins: 
the first one is chosen to emulate the current observational 
samples, while the second approximates data from future 
cluster surveys. Our main conclusions can be summarized 
as follows: 

Using the current sample, M\ f is only disfavoured by a factor 
of 9 (3) for the simulation with positive (negative) ^nl- 
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Figure 3. Constraints on ijiml for model Afig (where /nl = is assumed a priori). Left panel: simulation with ijnl = 5- 10^, right panel: 
simulation with g^L = ~5 ' 10"". Line styles are the same as in Fig. 2. The vertical dotted line shows the input value g^jL = it 5 • 10^. 
Note that the long-dashed curve (z = 1) seems to be missing in the left panel because the corresponding probability density is too low. 



• Fitting the mock data with the input (two-parameter) 
PNG model gives unbiased results although estimates for 
/nl and gNL turn out to be degenerate. This degeneracy 
can be at least partially broken using higher-order statistics 
of galaxy clustering, like the bispectrum. 

• If we assume a purely quadratic model ((?nl = 0), the 
maximum a posteriori estimate for /nl is biased high (low) 
for the simulations with gNL = 5 • 10^ (snl = —5 • 10^), 
respectively. The input value of /nl = 50 is almost always 
excluded at very high confidence. If we consider each red- 
shift bin separately, a spurious systematic shift of the /nl- 
estimate with z appears, which also depends on the sign of 
(7nl. If seen in observations, this effect could actually indi- 
cate the necessity for a more complex model. 

• If we assume a purely cubic model (/nl = 0), the maxi- 
mum a posteriori estimates for (;nl are artificially shifted to 
larger values, which differ from the input value much more 
than the broadness of the posterior distribution. When the 
fiducial PNG parameters have opposite sign, the two contri- 
butions to the scale-dependent bias can cancel out, leading 
to a false null detection. 

• Our analysis could be extended by considering even 
higher-order terms in the expansion of equation ((T]). How- 
ever, this phenomenological approach adds degrees of free- 
dom which are not necessarily independent because they 
may be related to each other through the underlying physics. 
Alternatively, one could test specific inflationary models and 
put direct constraints on their properties (e.g. coupling con- 
stants) that determine the PNG parameters. 

• Given a dataset, statistical model selection techniques 
are capable to identify the optimal PNG parameterization 
and thus avoid estimation biases due to incorrect simpli- 
fying assumptions. The Bayes factor very strongly favours 
the two-parameter model over a purely quadratic or purely 
cubic model for our simulations. It is thus important to ap- 
ply this model selection to future constraints on PNG from 
observational data. 
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